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STIC Biotechnology Syst ems Branch 

RAW SEQUENCE LISTING 
ERROR REPORT 



The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: ; 



Application Serial Number: 
Source: . (rjc'/k- 



Date Processed by STIC: 



THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

5 f^CLTOWG ACOPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 

APPLICANT WITH A NOTICE TO COMPLY or, 
2) TCLEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 

FOR^S^uJmI^IO^AND PATENTIN SOFTWARE QUESTIONS, PLEASE CONTACT 
MARK SPENCER, TELEPHONE: 571-272-2510; FAX: 571-273-0221 

TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VF.RSION 4.2.2 PROGRAM. ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW FOR ADDRESS: 
http://www.uspto.gov/web/offices/pac/checker/chkrnote.htm 

Applicants submitting genetic sequence information electronically on diskette or CD-Rom should be aware thai .here is 
a possibility that the disk/CD-Rom may have been affected by treatment given to a I mcommg mail 
Please consider using alternate methods of submission for the disk7CD-Rom or replacement disk/CD-Rom. 

A^rs, e g , j» eie « ™ if - f - rm shmiid nqt te ^ ,o >he 2023 1 z , ,p c " de add : e :!, for 1 : c 

United sLes Patent -H Tr.He.nwk Office ™« should be sent via the follow,.., to the md.cated addr esses, 

L FF^.Rif! ( <http://www.uspto.ynv/ehc/efs/do wnloads/documents.htm>, EFS Submission 

User Manual - ePAVE) 
2. U.S. Postal Service: Commissioner for Patents, P.O. Box 1450, Alexandria, VA 

3 Hand Carry, Federal Express, United Parcel Service, or other delivery service (EFFECTIVE »/M/«5). 
U S Patented Trademark Office, Mail Stop Sequence, Customer Window, Randolph Bu.ld.ng. 40 1 Dulany Street. 
Alexandria. V A 22314 • - -•• 
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LISTE DE SEQUENCES 



1) INFORMATIONS GENE 






^ Does Not Comply 
^ormctodOfsketteiVoode 



i) DEPOSANT: 

(A) NOM: CENTRE NATIONAL DE LA RECHERCHE SCIENTI 

B) RUE: 3, RUE MICHEL-ANGE 

C) VILLE: PARIS 

E) PAYS: FRANCE 

F) CODE POSTAL: 75794 CEDEX 16 

(A) NOM: INSTITUT NATIONAL DE LA SANTE ET DE LA 
RECHERCHE MEDICALE 

(B) RUE: 101, RUE DE TOLBIAC 

(C) VILLE: PARIS 

(E) PAYS: FRANCE 

(F) CODE POSTAL: 75654 CEDEX 13 



FI( 



OTAMMENT DANS DES COMPOSITIONS PHARMACEUTIQUES ET POUR LE DIAGNOSTIC 




*E DE L' INVENTION: ANALOGUES PEPTIDIQUES, ET LEURS UTILISATIONS 



ROMBRE - DE~SE@UBNGESh — 4- 




FOI 



DECHIFFRABLE PAR ORDINATEUR: 



0 TYPE DE SU-PTOKTT Floppy disk 

(B) ORDINATEUR: IBM PC compatible 

(C) SYSTEME D' EXPLOITATION: PC-DOS/MS-DOS 

(D) LOGICIEL: Patentln Release #1.0, Version #1.30 (OEB) 



(v) DONNEES DE LA DEMANDE ACTUELLE, 



NUMERO DE LA DEMANDE : CPCT/FR98/0I 



INFORMATIONS POUR LA SEP ID NOj) 1: 




<vi) DONNEES DE LA DEMANDE ANTERlEURE: 

(A) NUMERO DE LA DEMANDE: FR 97.05677 
IB) DATE DE DEPOT: 07-MAY-1998 




*1U~.~: ft) 




i) CARACTERISTIQUES DE LA PEQVEtl^E: 
(A) LONGUEUgi_X<ac i_da^ amin lgD 

Lney 



B) TYPE:^cideamine; 

(C) NOMBREDeTbrTnTT 

(D) CONFIGURATION: lineal 




(ii) TYPE DE MOLECULEj) peptide 



(Mxi) DESCRIPTION DE LA SEQUENCE^ SEQ ID NO: 1: 



Ala Ala Gly He Leu Thr Val 
1 5 



(2) (INFORMATIONS POUR LA SEQ ID 




[i KTjCARACTERISTIQUES DE LA SEQUEl 
(A) LONGUEUR: 9 acides amines 






k&) TYPE: acide amine 

(C) NOMBRE DE BRINS : 

(D) CONFIGURATION: lineaire 



(ii) TYPE DE MOLECULE: peptide 

(xi) DESCRIPTION DE LA SEQUENCE: SEQ ID NO: 2 



Gly Leu Leu Gly Phe Val Phe Thr Leu 
1. 5 



INFORMATIONS POUR LA \SEQ ID NO: 3: 

"(i) CARACTERI STIQUES DE LA SEQUENCE: 

(A) LONGUEUR: 9 acides amines 

(B) TYPE: acide amine 

(C) NOMBRE DE BRINS: 
H1FIGURATION: lineaire 

(ii) TYPE DE MOLECULE: peptide 

(xi) Asc ription de~^a^sequenC ^ seq id no: 3 

Ala Val Asp Leu Ser His Phe Leu Lys 
1 - 5 



Informations pour la\seq to mq : 4 : 



(i) CARACTERI STIQUES DE LA SEQUENCE 

(A) LONGUEUR: 9 acides amines 

(B) TYPE: acide amine 

(C) NOMBRE DE BRINS: 
CONFIGURATION: lineaire 

(ii) TYPE DE MOLECULE: peptide 



(xi) Description de la sequence^ seq id no: a 




0 J/t/ol, U7 J 



Ser Leu Tyr Asn Thr Val Ala Thr Leu 
1 5 



AMI 
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/ Rulei and Regulations 



(3] Computer Apple Macintosh; 

(I) Operating System: Macintosh; 

(fi) Macintosh File Type: text with line 
termination 

(ill) Une Terrnlna ton* Pre -defined by 
text type file; - ' 

(iv) Pagination: Predefined by text 
type file; 

(v) End-pf-file: Pro -defined by text 
type file; 

(vl) Media: (A) Diskctt^3.50 Inch. 400 
Kb storage; 

(DJ Diskette— 3.50 Inch. 600 Kb 
•toragc; 

(C) Diekellc^sJ^&p Inch. l.< Mb 
storage; 

(vlij Print Command* Use PRINT 
command from any Macintosh 
Application that proccaici text ft lea, 
euch as MacVVritc or Teach Text; 

(4) Magnetic tape: 0.5 Inch, up to 2400 
feet; 

(i) Density: 1000 or 0250 bits per Inch, 
0 track; 

(II) Format: /aw, unblocked; 

(III) Une Terminator ASCn Carriage 
Return plus optional ASCII line Feed; 

(iv) Poglnotion: ASCU Form Feed or 
Scries of Line Terminators; 

(v) Print Command (Unix shell version 
given here as sample response — mt/ 
dcv/rmtO;1pr/dor/nntO): 

(g) Computer readable forms thot arc 
submitted to the Office will not be 
returned to the applicant. 

(h) All computer readable forms shall 
have a lebdlj^rmartcntly affixed thereto 
oh Vrhlch has •keen'harid printed or 
typed, a description of tho format of the 
computer readable form as well as the 
name of the applicant, the title of the 
Invention, the date on which the data 
wcro recorded on the computer readable 
form and the name and type of Computer 
and operating system which generated 
the files on the computer roadablc form. 
If all of this Information cannot be 
printed oh a label affixed to the 
computor readable form, by reason of 
size or otherwise, the label shall Include 
the name of the applicant and the tltje- of 
the Invention and a reference number, 
and the additional Information may be 
provided on a container for the 
computer readable form witb the name 
of the applicant, the tfUo of the 
invention, the reference number and tho 
additional Information affixed to the 
container. If the computer readable form 
Is submitted after the date of filing 



under 35 U.6.C. 111. after the dale of 
entry In the national stage, under 35 
U.S.C-371 or after ihe Ume.of filln^/ln j 
the United States Receiving Office, an £ 
International application under'the PCT, 
the labels mentioned "herein must alio 
Include the date of the application and 
the application number/Including »crlei 
code and serial number. 

I 1,625 Am*odm*nU lo or r»pUc*m*nl of 
s^qoroo* Utk\g and compvUr r»*<Ub& 
copy tfv^cvof. 

(a) Any amendment to the paper copy 
of the "Sequence Listing" (t l.&21(c)) 
must bo made by the submission of 
substitute sheets. Amendments musPtfc. 
accompanied by a statement that 
Indicates support for the amendment In 
the application, as. filed, and a statement 
tii at Lho substitute sheets Include no 
new matter. Such a statement must be a 
verified statement If made by a person 
not registered to practice before the 
Office. 

(b) Any amendment lo the paper copy 
of die "Sequence Usling. M In accordanco 
with paragraph (a) of this section, must 
be accompanied by a substitute copy of 
the computer readable form ($ 1.821(c)) 
Including all previously submitted data 
with tho amendment incorporated 
therein, accompanied by a statement 
that the copy Ln computer readable form 
Is the same as the substitute copy of the 
"Scquoncc Listing." Such a statement 
must be a vcrifiod statement If made by 
a person not registered to practice 
before the Office. 

(c) Any appropriate amendment* lo 
the "Sequence Listing" In a patent, e.g.. 
by reason of reissue or certificate of 
correction, must comply with the 
requirements of paragraphs (a) and (b) 
of Oils section, 

(d) If. upon receipt, the computer 
readable form Is found to be damaged or 
unreadable, applicant must provide, 
within such time as set by tho 
Commissioner, a substitute copy of tho 
data In computer readable form 
accompanied by a statement that the 
substitute data Is Identical to thai 
origlnolly filed. Such a statement must 
be a verified statement if made by a 
person not registered to practice before 

~" 'Xtciu : — 

Appendix A — Saraplo Sequence Listing 
(1) GENERAL INFORMATION: 



(1) APPLICANT: Do*. |o*n X Doe. John Q 

(II) TITLE OF INVENTION: laoUtlorVVnd 

CharsctclUatJon of * One Encoding a 
Protease from Paramecium ip. 

(III) NUMBER OF SEQUENCES: 2 
(Iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSES: "Smith and Jooei 

(B) STREET; 123 Ms In Elrcel 

(C) OTY: Scaatllbwn 

(D) STATE: AaTysta W 

(E) COUNTRY:\SA - 

(F) 2UP;ltM$ 

(0 COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: DUkette. 3.50 Inch. 600 
Kb storsgo ^ ... , ^ 

(B) COMPUTER: AM* Maclnlojh 

(C) OPERA TINC SYSTEM: Mclntoih 5.0 

(D) SOFTWARE: MscWrfto 

(vi) CURRENT APPLICATION DATA 

(A) APPLICATION NUMBER: 03/990.999 

(B) FtUNC DATE: 25-FEB-1W9 

(C) CLASSIFICATION: P99/99 
(vll) PRJOR APPLICATION DATA: 

(A) APPLICATION NUMBER: PCT/USfifl/ 
V9999 

(B) FILINC DATE: OI-MAR-1900 

(vill) ATTORNEY/ ACENT INFORMATION: 
(A) NAME: Smith, Jolm A. 

(D) REGISTRATION NUMDER: 00001 

(C) REFERENCE /DOCKET NUMDER: 01- 
0001 

fix) TELECOMMUNICATION 
INFORMATION: 
(A| TELEPHONE: (0O3) 099-0001 
(0) TELEFAX: (000) P99-0002 

(2) INFORMATION FOR SEQ 10 NO: 1 : 

(I) SEQUENCE 0 IARACTERIST1CS: 
(A) LENCTH: CM bate pain 

(D) TYPE: nucleic «cld 

(C) STRAND ED NESS: tingle 
(DJ TOPOLOCY: linear 

(II) MOLECULE TYPE: genomic ON A 
(HI) IfYPCrntETICAL: yci 

(Iv) ANTI -SENSE: ho 

(vi] ORJCLNAL SOURCE. 
(A) ORCANISM: Paramecium ip 
(C) INDfVlDUAL/ISOLATE: XYZJ 

(C) CELL TYPE: unicellular organism 
(vil) IMMEDIATE SOURCE: 

(A) LIBRARY: genomic 
(B| CLONE: P«r«-XYZ2/M 
(x) PUBLICATION INFORMATION: 

(A) AUTHORS: Doe. |oan X, Doc. John Q 

(B) TTTLE- Uolation and Characterization 
of a Ceoe Encoding a-Protca«c from 
Paramecium a p. 

(CI JOURNAL: Ficliooel Ccnci 

(D) VOLUME: I 

(E) ISSUE: i 

(F) PACES: 1-20 

(C) DATE: O2MAR-10G8 

[K] RELEVANT RESIDUES IN SEQ ID NO. 
l:FROM 1 TO0M 

BU_I_IhO cooc >JIO-1*-« 
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~& (2) INFORMATION PORSEQ CD NO: ti 
(I) SEQUENCE CHARACTERISTICS: 
(A) IXNGTK: tl amino addj 
(8) TYPE: amino add 
(DJ TOPOLOCYj linear 
(llj MOLECULE TYPE: proltln 
(U) FEATURE: 
(A) NAME/KEY: «l$n*t icqucnc* 
(BJ LOCATION: to -1 



(C) IDENTIFICATION METHOD: almilazlty . 

to olhar «tgnal arquence*. hydrophobic 
(DJ OTHER INFORMATION: •xprtMe* 
prole*** 

(k) PUDUCATION INFORMATION: 
(AJ AUTHORS: Doc. Joan XDoo. JoKn Q 
(BJ TITLE: Notation and GuracierUation 
. of a Gen* Encoding a Protcaia from 
Paramecium tp. ' 



(Q JOURNAL: Plcllonal Ccnai 
(DJ VOLUMfc I 

(E) ISSUE: 1 

(F) PACES: 1-20 
(CJ DATE: C^MAR-lWfi . 
(K) RELEVANT RESIDUES IN SEQ ID NO: 

X: FROM -W TO « 



a. 



he: 
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(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

ATCGGGATAG TACTGGTCAA GACCGGTGGA CACCGGTTAA CCCCGqjTAA GTAC$GGTTA 60 

TAGGCCATTT CAGGCCAAAT GTGCCCAACT -ACGCCAATTG T1TTGCCAAC- GGCCAACGTT 120 

ACGTTCGTAC GCACGTATGT ACCTAGGTAC TTACGGACGT GACTACGGAC ACTTCCGTAC 180 

GTACGTACGT TTACGTACCC ATCCCAACGT AACCACAGTG rTGGTCGCAGT GTCCCA^TGT* ; 24 0 

• -* * 

ACACAGACTG CCAGACATTC TTCACAGACA CCCC ATG ACA CCA CCT GAA OGT CTC 295 

Met Thr Pro Pro Glu Arg Leu 

• - -30 

TTC CTC CCA AGG GTG TGT GGC ACC ACC CTA CAC CTC.CTC CTT CTG GGG 34 3 
Phe Leu Pro Arg Val Cys Gly Thr Thr Leu His Leu Leu Leu Leu Gly 
-25 -20 -15 

CTG CTG CTG GTT CTG CTG CCT GGG GCC CAT GTGAGGCAGC AGGAGAATGG 393 
Leu Leu Leu Val Leu Leu Pro Gly Ala His 
-10 -5 

GGTGGCTCAG CCAAACCTTG AGCCCTAGAG CCCCCCTCAA CTCTGTTCTC CTAG GGG 4 50 

Gly 



CTC ATG CAT Ct'T GCC CAC AGC AAC CTC AAA CCT GCT GCT CAC CTC ATT 4 90 
Leu Met His Leu Ala His Ser Asn Leu Lys Pro Ala Ala His Leu He 
1 5 10 15 

CTAAACATCC ACCTGACCTC CCAGACATGT CCCCACCAGC TCTCCTCCTA CCCCTGCCTC 558 

AGGAACCCAA GCATCCACCC CTCTCCCCCA ACTTCCCCCA CGCTAAAAAA AACAGAGGGA 618 

GCCCACTCCT ATGCCTCCCC CTGCCATCCC CCAGGAACTC AGTTGTTCAG TGCCCACTTC 678 

TAC CCC AGC AAG CAG AAC TCA CTG CTC TGG AGA GCA AAC ACG GAC CGT 726 
Tyr Pro Ser Lys Gin Asn Ser Leu Leu Trp Arg Ala Asn Thr Asp Arg 
20 25 30 

GCC TTC CTC CAG GAT GGT TTC TCC TTG AGC AAC AAT TCT CTC CTG GTC 77 4 
Ala Phe Leu Gin Asp Gly Phe Ser Leu Ser Asn Asn Ser Leu Leu Val 
35 40 45 

TAGAAAAAAT AATTGATTTC AAGACCTTCT CCCCATTCTG CCTCCATTCT GACCATTTCA 8 34 

GGGGTCGTCA CCACCTCTCC TTTGGCCATT CCAACAGCTC AAGTCTTCCC TGATCAAGTC 894 

ACCGGAGCTT TCAAAGAAGG AATTCTAGGC ATCCCAGGGG ACCCACACCT CCCTGAACCA 954 

MLUMO COOC JJIO-M-C 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

Met Thr Pro Pro Glu Arg Leu Phe Leu Pro Arg Val Cys Gfly Thr»Thr 

-30 -25 -20 

Leu His Leu Leu Leu Leu Gly Leu Leu Leu Val Leu Leu Pro Gly Ala 

-15 -10 „ -5 \ ' 

• - f- 

His Gly Leu Met His Leu Ala His Ser Asn Leu Lys Pro Ala Ala His 
1-5 10 

Leu lie TyrWro Ser Lys Gin Asn Ser Leu Leu Trp Arg Ala Asn Thr 
15 20 25 . y ., 30 

Asp Arg Ala Phe Leu Gin Asp Cly Phe Ser Leu Ser Asn Asn Ser Leu 

35 40 45 

Leu Val 

etUJHC COOC Mt»-t4-C \ 



